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CESAR MALDONADO 

Abstract. We are interested in time series of the form = x„ + ^„ where {a:„} is 
generated by a chaotic dynamical system and where models observational noise. 
Using concentration inequalities, we derive fluctuation bounds for the auto-covariance 
function, the empirical measure, the kernel density estimator and the correlation di- 
mension evaluated along yo, • • • , 2/n, for all n. The chaotic systems we consider include 
for instance the Henon attractor for Benedicks-Carleson parameters. 



1. Introduction 

Practically all experimental data is corrupted by noise, whence the importance of 
modeling dynamical systems perturbed by some kind of noise. In the literature one finds 
two principal models of noise. On one hand, the dynamical noise model in which the 
noise term evolves within the dynamics (see for instance [2] and references therein) . And 
on the other hand, the so-called observational noise model, in which the perturbation is 
supposed to be generated by the observation process (measurement). In this paper we 
focus on the latter model of noise. 

Suppose that we are given a finite 'sample' yo, . . . , Un-i of a discrete ergodic dynamical 
system perturbed by observational noise. Consider a general observable K{yQ, . . . , yn-i)- 
We are interested in estimating the fluctuations of K and its convergence properties as 
n grows. Our main tool is concentration inequalities. Roughly speaking, concentration 
inequalities allow to systematically quantify the probability of deviation of an observable 
from its expect value, requiring that the observable is smooth enough. The systems for 
which concentration inequalities are available must have some degree of hyperbolicity. 
Indeed, in [7], the authors prove that the class of non-uniformly hyperbolic maps mo- 
deled by Young towers satisfy concentration inequalities. They are either exponential 
or polynomial depending on the tail of the corresponding return-times. Concentration 
inequalities is a recent topic in the study of fluctuations of observables in dynamical 
systems. The reader can consult [S] for a panorama. 

The article is organized as follows. In section 2, we give some general definitions 
concerning observational noise and concentration inequalities. We give some typical ex- 
amples of systems perturbed by observational noise. In section 3, we prove our main 
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theorem, namely, concentration inequalities for observationally perturbed systems (ob- 
served systems). As a consequence, we obtain estimates on the deviation of any sepa- 
rately Lipschitz observable K{yQ, . . . , Hn-i) from its expected value. Section 4 is devoted 
to some applications. We derive a bound for the deviation of the estimator of the auto- 
covariance function in the observed system. We provide an estimate of the convergence 
in probability of the observed empirical measure. We study the convergence of the 
kernel density estimator for a observed system. We also give a result on the variance of 
an estimator of the correlation dimension in the observed system. The observables we 
consider here were studied in [6j and [7] for dynamical systems without observational 
noise. 

2. Generalities 

2.1. Dynamical systems as stochastic process. We consider a dynamical system 
{X, T, fi) where {X, d) is a compact metric space and is a T-invariant probability 
measure. In practice, X is a compact subset of M". 

One may interpret the orbits {x,Tx, . . .) as realizations of the stationary stochastic 
process defined by Xn{x) = T"x. The finite-dimensional marginals of this process are 
the measures Hn given by 

n-l 

(1) dfIn{xo, . . . , Xn-l) = d^l{xo) 5x^=Tx,.i ■ 

i=l 

Therefore, the stochasticity comes only from the initial condition. When the system is 
sufficiently mixing, one may expect that the iterate T^x is more or less independent of 
X if A; is large enough. 

2.2. Observational noise. The noise process is modeled as bounded random variables 
^rt defined on a probability space {0,,B,P) and assuming values in X. Without loss of 
generality, W6 can assume that the random variables are centered, i.e. have expecta- 
tion equal to 0. 

In most cases, the noise is small and it is convenient to represent it by the random 
variables e^j where e > is the amplitude of the noise and is of order one. 

We introduce the following definition. 

Definition 1 (Observed system). For every i G N L) {0} (or i G Z if the map T is 
invertible), we say that the sequence of points {yi} given by 

yi ■.= Tx + eii, 

is a trajectory of the dynamical system [X, T, fi) perturbed by the observational noise 
{(.n) with amplitude e > 0. Hereafter we refer to it simply as the observed system. 

Next, we make the following assumptions on the noise. 
Standing assumption on noise: 

(1) (^n) is independent of Xq and ||.^n|| < 1; 

(2) The random variables are independent. 
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Remark 1. As we shall see, the need not he independent, although it is a natural 
assumption in practice. 

We notice that, under the same assumption on the noise, the authors of ^lOj give a 
consistent algorithm for recovering the unperturbed time series from the sequence {yi\. 
They assume that the process is generated by a sufficiently chaotic dynamical 

system. The merit of Lalley and Nobel (11^) is that a few assumptions are made 
(compare with Kantz-Schreiber's or Abarbanel's books Oil]). In contrast, for the case 
of unbounded noise (e.g. Gaussian) and if the system present strongly homoclinic pairs 
of points, then with positive probability it is impossible to recover the initial condition 
of the true trajectory even observing an infinite sequence with noise (see also [lOj). 

2.3. Examples. 

Example 1. Consider Smale's solenoid map, Ts -.R^ ^ which maps the torus into 
itself: 

Ts ((/>, u, v) = {2(j) mod 27r, j3u + a cos{(j)), Pv + a sin((/))) , 
where < /? < 1/2 and (3 < a < 1/2. Let the random variables S^i be uniformly 
distributed on the solid sphere of radius one. For every vector x = {(j), u, v) in the torus, 
the observed system is given by yi = Ts{xi) + e^j, for some fixed e > 0. 

Example 2. Take (the unit circle) as state space. Let us fix an increasing sequence 
ao < til < • • • < fflfc = (^^d consider for each interval (aj,aj+i) (0 < j < k — 1) 
a monotone map Tj : (aj,aj+i) — >■ S"*^. The map T on is given by T{x) = Tj{x) if 
X G (oj, Oj+i). It is well known that when the map T is uniformly expanding, it admits an 
absolutely continuous invariant measure ^. It is unique under some mixing assumptions. 
Let P be the uniform distribution on S^. The observed sequence is = T*(x) + e,^,. 
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Figure 1. Top: Simulation of the Lozi map for the parameters a=1.7 
and b=0.5. Bottom: A simulation of the observed Lozi map with obser- 
vational noise whose magnitude is bounded by e = 0.06. 
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Example 3. The Lozi map Ti :M? ^ Mp is given by 

Tl{u, v) = {1 — a\u\ + V, bu) , (n, v) G M^. 

For a = 1.7 and b = 0.5 one observes numerically a strange attractor. In |8] the authors 
constructed a SRB measure fi for this map. It is also included in Young's framework 
|13] . Now, as state space of the random variables we take Bi{0), the ball centered at zero 
with radius one. Consider the uniform probability distribution on Bi{0). Let us denote 
by X the vector {u,v) and let e > 0, so, the observed system is given by yi = T\^x + e^j. 

Example 4. Consider the Henon map Th -.M? ^ M? defined as 

Th{u, v) = [l — au^ + bu) , (u, v) G M^. 

Where < a < 2 and b > are some real parameters. The state space of the random 
variables is again Bi{0) with the uniform distribution on it. Let be x = {u,v), then 
the observed system is given by yi = Tj^x + e^j. It is known that there exists a set of 
parameters (a, b) of positive Lebesgue measure for which the map Th has a topologically 
transitive attractor A, furthermore there exists a set A C with Leb(A) > such that 
for all (a, 6) G A the map Th admits a unique SRB measure supported on A f|3] j. 




Figure 2. Top: A simulation of the Henon map for the classical parame- 
ters a=1.4 and b=0.3. Bottom: Simulation of the observed Henon map 
with observational noise whose magnitude is uniformly bounded by e = 
0.04. 



Example 5. The Manneville-Pomeau map is an example of an expansive map, except 
for a point where the slope is equal to 1 (neutral fixed point). Consider X = [0, 1], and 



FLUCTUATION BOUNDS FOR CHAOS PLUS NOISE IN DYNAMICAL SYSTEMS 



5 



for the sake of definiteness take 

'x + 2°x^+" if xe [0,1/2) 



Ta{x) 



2x-l if xe [1/2,1), 



where a E (0, 1) is a parameter. It is well known that there exists an absolutely contin- 
uous invariant probability measure diJ,{x) = h{x)dx and h{x) ~ when x — )• 0. The 
observed sequence is defined by yi = T^{x) The random variables are uniformly 

distributed in X . One identifies the [0, 1] with the unit circle to avoid leaks. 

2.4. Concentration inequalities. Let X be a metric space. For any function of n 
variables K : — t- M, and for each j, < j < n — 1, let 

\K{xo,...,Xj,...,Xn-l) - K{xo,...,x',,...,Xn-l)\ 

Lipj{K) := sup sup . 

We say that K is separately Lipschitz if, for all < j < n — 1, Lipj(i^) is finite. 

Now, we may state the fohowing definition. 

Definition 2. The stochastic process (Yn) taking values on X satisfies an exponential 
concentration inequality if there exists a constant C > such that, for any separately 
Lipschitz function K of n variables, one has 

(2) E |^e^(^o,-,>'n-i)-E(X(yo,...,y„_i))^ < E"Jo Lip, W . 
Notice that the constant C depends only on T, but neither on K nor on n. 

A weaker inequality is given by the following definition. 

Definition 3. The stochastic process (Yn) taking values on X satisfies a polynomial 
concentration inequality with moment q > 2 if there exists a constant Cq > such that, 
for any separately Lipschitz function K of n variables, one has 

q/2 

(3) E {\K{Yo, . . . , y„_i) - E{K{Yo, Y^^^^ < C J ^ Upj{Kf ' 

\j=o 

As in the previous definition the constant Cq does not depend neither on K nor on n. 

Remark 2. When q = 2, we have a bound for the variance of K{Yq, . . . , Yn). 

Remark 3. // (Yn) is a bounded i.i.d. process then it satisfies ^ (see e.g. It 
also satisfies ^ for all q > 2, see e.g. [3] for more details. 

These concentration inequalities allow us to obtain estimates on the deviation proba- 
bilities of the observable K from its expected value. 

Proposition 1. If the process (Yn) satisfies the exponential concentration inequality ([2]) 
then for all t > and for all n > 1, 

(4) F{\K{Yo, . . . , y„_i) - EiKiYo, . . . , y„_i))| > t} < 2e*^^"-"o ^^p.(^)^ . 
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// the process satisfies the polynomial concentration inequality ([3]) for some q > 2, then 
we have that for all t > and for all n> 1, 

(5) P{|i^(yo,••.,>;-l)-E(K(Fo,••.,>;^-l))| >t}< ^(5^Lip^W')'^'. 

3=0 

The inequality Q follows from the basic inequality P(Z > t) < e^'*'*E(e^^) with 
A > applied to Z = K(Yo, . . . ,Yn-i) — ¥,{K(Yo, . . . ,Yn-i)), using the exponential 
concentration inequality ^ and optimizing over A. The inequality ([s]) follows easily 
from Q and the Markov inequality (see [5j for details). 

It has been proven that a dynamical system modeled by a Young tower with expo- 
nential tails satisfies the exponential concentration inequality [7j. The systems in the 
examples from [T] to |4] are included in that framework. The example [5] satisfies the poly- 
nomial concentration inequality with moment g < ^ — 2 for a G (0, 1/2), which is the 
parameter of the map (see for full details) . 



3. Main theorem & corollary 



Let us first introduce some notations. We recall that P is the common distribution of 
the random variables ^j. The expected value with respect to a measure v is denoted by 
Ely. Recall the expression ^ for the measure ^n- Hence in particular 



^tini^) = J ■■■J K{xo, ■ ■ ■ ,Xn-l)dfin{xo, . . ■ ,Xn-l) 

= /A-,.....,r»-..)d,.(.). 

Next, we denote by fin'S'P" the product of the measures fin and P", where stands 
for P (g) • • • (g) P (n times). The expected value of K{yo, . . . , yn-i) is denoted by 

E^n^P-W := / K{x + e^o,---,T^'^x + ein-i)Mx)dP{io)---dP{in-i). 

Our main result is the following. 

Theorem 1. If the original system {X,T,fi) satisfies the exponential inequality then 
the observed system satisfies an exponential concentration inequality. For any n > 1, it 
is given by 

(6) IE^„®P" (^e^(S/0,-,?/n-l)-E^„«pn(i^{yo,...,?/n-l))^ < g^(l+£' ) E"Jo LiP, C-?^)" ^ 

Furthermore, if the system {X, T, fi) satisfies the polynomial concentration inequality 
([3]) with moment q > 2, then the observed system satisfies a polynomial concentration 
inequality with the same moment. For any n>l, it is given by 

(7) 

Em„®P" {\K{yo,.. . ,2/„_i) -E^„^pn(i^(yo,.. . ,yn-i))r) < D,(l + e)'?( ^ Lip^(i^)')'' . 

i=o 

Observe that one recovers the corresponding concentration inequalities for the original 
dynamical system when e vanishes. 
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Remark 4. Our proof works provided the noise process satisfies a concentration inequal- 
ity (see Remark^. We have stated the result in the particular case ofi.i.d. noise because 
it is reasonable to model the observational perturbations in this manner. Nevertheless, 
one can slightly modify the proof to get the result valid for correlated perturbations. 

Proof of theorem^ First let us fix the noise {^j} and let ^ := (^Oi'^i) • • • ,Cn-i)- Intro- 
duce the auxiliary observable 

K^{xo, . . .,Xn-i) := K{xo + e^o, • • ■,Xn-l + e^n-i)- 

Since the noise is fixed, it is easy to see that Lip j{K-^) = Lipj{K) for all j. 

Notice that k^{x, . . . .T^-^x) = K{x + eCo, ■ ■ . ,T'^'^x + £(,n-i) = K{yo,.. . ,2/„-i). Next 
we define the observable -F(^0; ■ ■ ■ , Cn-i) of n variables on the noise, as follows, 

F(Co, . . . , Cn-i) := IEmJ^J • • • > T'^-'x)). 
Observe that. Lip j{F) < eLip j{K). 

Now we prove inequality ([6]). Observe that is equivalent to prove the inequality for 



E 



Adding and subtracting E^^^^pn [K-^{x, . . . , T" ^x)) and using the independence between 
the noise and the dynamical system, we obtain that the expression above is equal to 

E^ (^eK^i^^...,T"--^x)-E^„iK^{x,...,T"-^x))\^ jg^^ J^gF(5o,...,€n-i)-Epn(F(5o,...,Sn-i)) 

Since in particular, i.i.d. bounded processes satisfy the exponential concentration in- 
equality (see Remark [3] above), we may apply ^ to the dynamical system and the 
noise, yielding 



E 



I^^K^{x,...,T"'''x)-E^„{K^{x,...,T"-^x))'^ J^gFKo,-,Cn-i)-Epn (F(^0,-,€n-l))^ 

where D := max{C, C'}. 

Next, we prove inequality ([T]) similarly. We use the binomial expansion after the triangle 
inequality with K^^(K-^{x, . . . ,T"~^x)). Using the independence between the noise and 
the dynamics, we get 

^/in(8)P"{\K{yo, ■ ■ -^yn-l) - E^„cg,pn(i^(yo, . . .,yn-lW) 



(8) ^ E ( p • • ■ , T^-'x) - E,„iK^ix, . . . , T--'x))nx 

Epn (|F(^o, • • . , ^n-l) - Epn(F(eo, . . . , ^n-lW-") . 

We proceed carefully using the polynomial concentration inequality. The terms corre- 
sponding to p = 1 and p = q — 1 have to be treated separately. For the rest we obtain 
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1-p 
2\ 2 



the bound 

q , s n—1 ,„ n — 1 

E U)^.(ELip.(^)T x^^-.(-'ELip.(i^) 

p=o j=o i=o 

For the case p = 1, we apply Cauchy-Schwarz inequahty and ([3]) for (7 = 2 to get 

If g = 2, we proceed in the same way for the second factor in the right hand side of 
^. The case p = q — lis treated similarly. Finally, putting this together and choosing 
adequately the constant Dq we obtain the desired bound. □ 

Next we obtain an estimate of deviation probability of the observable K from its 
expected value. 

Corollary 1. If the system {X,T,n) satisfies the exponential concentration inequality, 

then for the observed system {yi}, for every t > and for any n>l we have, 

(9) 



/x„®P"(|i^(yo,...,2/n-i)-E^„®P"(i^)| >t) <2exp 



-t" 



4D(l + e2)^-iLip,.(i^)2^ 



// the system (X, T, /i) satisfies the polynomial concentration inequality with moment 
q >2, then the observed system satisfies for every t > and for any n > 1, 



1 \ 9/2 

n—1 * 



(10) ^^n^P^{\K{yo,...,yn~l)-E^.^^P-{K)\>t) < ^{l + eY | Y.^iv,{K) 



ti 

The proof is straightforward and left to the reader. 



4. Applications 

4.1. Dynamical systems. Concentration inequalities are available for the class of non- 
uniformly hyperbolic dynamical systems modeled by Young towers ([Z])- Actually, sys- 
tems with exponential tails satisfy an exponential concentration inequality and if the tails 
are polynomial then the system satisfies a polynomial concentration inequality. The ex- 
amples given in section 2 are included in that class of dynamical systems. We refer the 
interested reader to |13j and |14j for more details on systems modeled by Young towers. 
Here we consider dynamical systems satisfying either the exponential or the polynomial 
concentration inequality. We apply our result of concentration in the setting of observed 
systems to empirical estimators of the auto-covariance function, the empirical measure, 
the kernel density estimator and the correlation dimension. 
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4.2. Auto-covariance function. Consider the dynamical system {X, T, fx) and a square 
integrable observable / : X — t- M. Assume that / is such that J /d// = 0. We remind 
that the auto-covariance function of / is given by 

Cov(fc) := j /(x)/(r'=x)d/i(x). 

In practice, one has a finite number of iterates of some /x-typical initial condition x, thus, 
what we may easily obtain from the data is the empirical estimator of the auto-covariance 
function: 

^ n— 1 

C^Ynik) ■.= -y^f{rx)f{T+^x). 
n ^-^ 

2=0 

Prom Birkhofi^'s ergodic theorem it follows that Cov(A;) = lim„_s.oo Cov„(A:) ^u-almost 
surely. Observe that the expected value of the estimator Cov„(A;) is exactly Cov(A;). 

The following result gives us a priori theoretical bounds to the fluctuations of the 
estimator Cov„ around Gov for every n. This result can be found in [7], here we include 
it for the sake of completeness. 

Proposition 2. Let Cov{k) and Cov„(A;) be defined as above. If the dynamical system 
{X, T, fi) satisfies the exponential concentration inequality ^ then for all t > and any 
integer n > 1 we have 



Covn(fc) — Cov(A:) 



> t < 2 exp 



l6Caj \n + k 



where aj = Lip(/) ||/||oo and C is the constant appearing in 

If the system satisfies the polynomial concentration inequality ([s]) with moment q > 2, 
then for all t > and any integer n > 1 we have 



>t] <C,^ ^ 



Covn{k) — Cov{k) 

where Cq is the constant appearing in ([s]). 

Proof. Consider the following observable of n + /c variables, 



2a/ y (n + k'^"'^ 



K{zo, Zn+k-l) ■=-yZ fi2i)fizi+k)- 

i=0 

In order to estimate the Lipschitz constant of K, consider 0</<n + fc — 1 and 
replace the value zi with z'^. Note that the absolute value of the difference between 
K{zq, . . . , zi, . . . , Zn+k-i) and K{zq., . . . , z'l, . . . , Zn+k-i) is less than or equal to 

^ \ f{zi-k)f{zi) + f{zi)f{zi+k) - f{zi-k)f{z'i) - f{z'i)f{zi+k)\ , 
and so for every index I, we have that 

UMK)< sup ...L^MM^imikt!^^ 

zo,...,z^+k-izi=^zin d{zi,Zi) n 
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Next, if the exponential inequality holds, we use Q to obtain 



H (CoYn{k) — Cov(A;) > t) < exp 



( 



n 



n-\- k 



Applying similarly the inequality to the function —K, we get the result by a union 
bound. The polynomial case follows from inequality ([s]). □ 

4.2.1. Auto-covariance function for observed systems. Let us consider the observed orbit 
Uo, . . . , Un-i- Define the observed empirical estimator of the auto-covariance function as 
follows 

^ n—1 

(11) cSv„(A;):=-V/(y,)/(y^+fc). 

n ^-^ 



i=0 



We are interested in quantifying the influence of noise on the correlation. We provide a 
bound on the probability of the deviation of the observed empirical estimator from the 
covariance function. 



Theorem 2. Let Cov„(A;) be given by (11). If the dynamical system {X,T,fj,) satisfies 
the exponential inequality Q then for all t > and for any integer n > 1 we have 



CoVn{k) — Cov(/c) 



> t + 2a/e] < 2 exp 



64L>a2(l + £2) \n + k 



n 



+2 exp 



16Ca2 \n + k 



n 



where aj = Lip(/)||/||oo, C and D are the constants appearing in ^ and ^ respectively. 
If the system satisfies the polynomial inequality with moment q > 2, then for all t > 
and any integer n > 1 we have 



Hn®P''[ Covnik) - Cov{k) > t + 2a/ej < {2'^Dg{l + eY + Cg 

where Cg and Dg are the constants appearing in ([s]) and ([T]) respectively. 
Proof. To prove this assertion we will use an estimate of 



2ajY /n + fc^''/^ 



Cov„(/c) - Cov„(A:) 



>t + E 



Cov„(A;) — Cov„(A;) 



First let us write Xi := T^x, and observe that by adding and subtracting f{xi + 
e^i)f{xij^k)i the quantity |Cov„(A;) — Cov„(A;)| is less than or equal to 

^ n— 1 

- V! \f{xi + e(i)[f{xi+k + e^i+k) - f{xi+k)] + [f{xi + e^i) - f{xi)]f{xi+k)\ , 



i=0 



which leads us to the following estimate. 



(12) 



E 



Cov„(fc) - Cov„(A:) <2eLip(/)||/|U. 
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For a given realization of the noise {ej}, consider the fohowing observable of n + k 
variables 

n ^-^ 

i=0 

For every < Z < n — 1, one can easily obtain that 

LipiiK) < ^Lip(/)||/||oo. 
n 

In the exponential case, from the inequality ([9]) and the bound (12) on the expected 
value of i^T, we obtain that 



CoVn{k) — CoVn{k) 



>t + 2sa,j<2exp I g^^..^, ^ 



Using proposition [2| a union bound and an adequate rescaling, we get the result. In 



order to prove the polynomial inequality, proceed similarly applying (10). □ 



4.3. Empirical measure. The empirical measure of a sample xq, . . . , Xn-i is given by 

where 5x denotes the Dirac measure at x. If the given sample xq, . . . , Xn-i is the sequence 
X, . . . , T^~^x for a /i-typical x G X, then from Birkhoff 's ergodic theorem it follows that 
the sequence of random measures {£n} converges weakly to the T-invariant measure /i, 
almost surely. 

Consider the observed itinerary yo, . . . , Un-i and define the observed empirical measure 

by 

^ n—l 

Observe that this measure is well defined on X. Again Birkhoff's ergodic theorem implies 
that almost surely 

n—l 

hm -y^g{y,)= [ [ ^(x + e)dMx)dP(e), 

i=0 

for every continuous function g. More precisely, this convergence holds for a set of 
measure one of initial conditions for the dynamical system (X, T) and a set of measure 
one of noise realizations (^i) with respect to the product measure P^. 

We want to estimate the speed of convergence of the observed empirical measure. 
For that purpose, we chose the Kantorovich distance on the set of probability measures, 
which is defined by 



n{p, v) := sup / gdfi - / gdiy, 

qGC J J 



where fi and are two probability measures on X and C denotes the space of all real- 
valued Lipschitz functions on X with Lipschitz constant at most one. 
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Now, we study the fluctuations of the Kantorovich distance of the observed empirical 
measure to the measure /i, around its expected value. The statement is the following. 

Proposition 3. // the system {X, T, fi) satisfies the exponential concentration inequality 
then for all t > and any integer n > 1, 



If the system satisfies the polynomial concentration inequality ^ with moment q > 2, 
then for all t > and any integer n > 1, 

Using the following separately Lipschitz function of n variables, 



K{zo, . . .,Zn-i) := sup 



^ n — 1 

n ^-^ ' 

j=0 



It is easy to check that Lipj(A') < ^, for every j = 0, . . . , n — 1. The proposition follows 
from the concentration inequalities ([9| and (10). 

We are not able to obtain a sufficiently good estimate of E^^^pn in di- 

mension larger than one, thus in the following we restrict ourselves to systems with 
X C M. 

Lemma 1 ([6j). Let {X,T,iJ,) be a dynamical system with X G M. If there exists a 
constant c > such that for every Lipschitz function f , the auto-covariance function 
Covf{k) satisfies that X]fcLi|C'OVj(A;)| < c||/||Lip, then there exists a constant B such 
that for all n > 1 

The proof of the preceding lemma is found in [HI Section 5]. It relies in the fact that 
in dimension one, it is possible to rewrite the Kantorovich distance using distribution 
functions. Then by an adequate Lipschitz approximation of the distribution function, the 
estimate bound follows from the summability condition on the auto-covariance function. 

As a consequence of proposition [3] and the previous lemma, we obtain the following 
result. 

Theorem 3. Assume that the system {X,T,n) satisfies the assumptions oflemma^ Let 
£n he the observed empirical measure. If the system satisfies the exponential inequality 
([2]) then for all t > and for all n > 1 we have that 

( _ t^B \ -^Vs 

If the system satisfies the polynomial inequality ([s]) with moment q > 2, then for all 
t > and for all n > 1 we obtain 
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Proof. Clearly E^„0pn (/{(£:„, ^)) < E^,„0pn(K(£:„,f„)) +E^„0pn(K(£:„,/x)). A straightfor- 
ward estimation yields 



^,i„(S)P"{l^{£n,£n)) < / sup 

gee 



n~l 



i=0 



d//„(g)P" < e. 



We obviously have E^^^pn {n^Sn, fJ-)) = E^^ {K{£n,fJ'))- Using the exponential estimate 
of proposition [3] and lemma [T] we obtain, for any t > 0, 



n 



1/4 



< exp 



Rescaling adequately we get the result. For the polynomial case, one uses the polynomial 
estimate of proposition [3| □ 

4.4. Kernel density estimator for one-dimensional maps. In this section we con- 
sider the system {X, T, fi) where X is a bounded subset of M. We assume the measure fi 
to be absolutely continuous with density h. For a given trajectory of a randomly chosen 
initial condition x (according to /u), the empirical density estimator is defined by, 



hnix;s) :-- 



1 



n-1 

E 

j=0 



T^x 



dr. 



where a„ — ^ and na„ — t- 00 as n diverges. The kernel "0 is a bounded and non-negative 
Lipschitz function with bounded support and it satisfies J = 1. We shall use the 

following hypothesis. 

Hypothesis 1. The probability density h satisfies 

j \h{s) - h{s-a)\ds < C'\af 

for some constants C > and /3 > and for every o" G M. 

This assumption is indeed valid for maps on the interval satisfying the axioms of 
Young towers with exponential tails (see [6l Appendix C]). For convenience, we present 
the following result on the convergence of the density estimator ([7]). 

Proposition 4. Let ip be a kernel defined as above. If the system (X, T, /x) satisfies the 
exponential concentration inequality ^ and the hypothesisl^ then there exist a constant 

> such that for any integer n > 1 and every t > + ^^,2 ^ , we have 



hnix; s) - h{s) 



ds>t] < e *cupW^ . 



Under the same conditions above, if the system satisfies the polynomial concentration 
inequality ^ for some q >2, then for any integer n > 1 and every t > (^On + 
we obtain, 



t^ 



hn{x; s) - h{s) 



ds > t < 



The parameter (3 is the same constant appearing in the hypothesis^ 
For the proof of this statement see [[Tj or Theorem 6.1 in [6]. 



14 



CESAR MALDONADO 



4.4.1. Kernel density estimator for observed maps on the circle. In order to avoid 'leak- 
ing' problems, now we assume X = S^. Given the observed sequence {yj}, let us define 
the observed empirical density estimator by 



hn{yo,---,yn-i;s) := — *— (- — —] 

nan ^ \ an J 



Our result is the following. 



Theorem 4. If {X, T, /x) satisfies the hypothesis [7] and the exponential concentration 
inequality, then there exist 
and for any integer n > 1, 



inequality, then there exists a constant > such that, for all t > ( an + -j=^ 



fln^P''' 



hn{yo, ■ ■ ■,yn-i; s) - h{s) ds>t + Lip(V')^ < exp 



where R := 4DLip(-i/')^. 

If the system satisfies the hypothesisUl and the polynomial concentration inequality, then 
for all t> C, U + -^)an,i ,orany ml J. n > 1, ..e Ha,e 



fln^P"" 



hniyo, yn-i;s) - h{s) ds>t + Lip(^)4 ] <dJ^^^ ^)Lip(V') ^ 



The parameter f3 is the same constant appearing as in the hypothesis\^ 
Proof Consider the following observable of n variables, 



K{zo, . . .,Zn~l 



^ ' / nan ^ ^ ( «n ) 



h{s] 



ds. 



It is straightforward to obtain that Lipi{K) < ^^^^^f^ , for every I = 0, ... ,n — 1. Next, 
we need to give an upper bound for the expected value of the observable K, first 

n-l 



E 



iK)< 



1 



na, 



E 

j=0 



On 
n-l 



S X j 



ar, 



h{s) 



dsJd//„®P" 
ds]dnn- 



Subsequently we proceed on each part. For the first one we get 



— E 

nan ^ 



3=0 



< 



an 

n-l 



ds ] dfLn (8) P'" 



Lip(^)e 



— E 

nan ■ „ an 

3=0 



d^„(g)P" < Lip(V')^. 



at 
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For the second part, there exist some constant such that 

/ ( / I i E K^) - M»)h)dM. < C, (aS + ^) . 

The proof of this statement is found in |6l Section 6]. We finish the proof applying 1^ 
and (10), respectively. □ 

4.5. Correlation dimension. The correlation dimension dc = ddfJ-) of the measure fi 
is defined by 

^ ^ log J f^{Br{x))dfi{x) 
r\o logr ' 

provided the limit exists. We denote by Corr(r) the spatial correlation integral which is 
defined by 

Corr(r) = j ^{Br{x))d^{x). 
As empirical estimator of Corr(r) we choose the following function of n variables 

Kn,r[xo, Xn-l) := ^ H{r - d{Xi, Xj)), 

where H is the Heaviside function. It has been proved (see e.g. |12j ) that 

Corr(r)= lim ^(x, . . . , T^-^x), 

n— >oo ' 

;U-almost surely at the continuity points of Corr(r). Next, given a /i-typical initial 
condition, let us consider the observed sequence yo, . . ■ ,yn-i, and define the estimator 
of Corr(r) for observed systems, as follows 

Kn,r{yo,---,yn-i) ■= ^ H {v - d{yi, yj)) . 

Since Kn,r{yo, ■ ■ ■ ,yn-i) is not a Lipschitz function we cannot apply directly concen- 
tration inequalities. The usual trick is to replace -ff by a Lipschitz continuous function 
(j) and then define the new estimator 

(13) KtAvo, . . . , yn-i) := E ^ (l - • 

The result of this section is the following estimate on the variance of the estimator 

Theorem 5. Let (p be a Lipschitz continuous function. Consider the observed trajectory 
yo, ■ ■ ■ ,yn-i and the function Kt,T{yoi ■■■ ,yn-i) given by (13). If the system {X,T,fi) 
satisfies the polynomial concentration inequality with q = 2, then for any integer n > 1, 



Var(i^^_,.) < L>2Lip(0)2(l + 
where Var(y) := E(y^) — E(y)^ is the variance ofY 



1 
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The proof follows the lines of section 4 in [6] , and by applying the inequality ([T]) with 
g = 2 and noticing that Lipi{Kt,r) < ^^r^'^ every / = 0, . . . , n — 1. 
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